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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT (S) : HOGREFE , Holly 

HANSEN, Connie J 

(ii) TITLE OF INVENTION: Polymerase Enhancing Factor (PEF) 

Extracts, PEF Protein Complexes, 
Isolated PEF Proteins, and Methods for 
Purifying and Identifying Them 
(iii) NUMBER OF SEQUENCES: 89 
(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: M. Paul Barker, Esq., Finnegan, 
Henderson, Farabow, Garrett & Dunner, LLP 

(B) STREET: 1300 I. Street, N-W. 

(C) CITY: Washington 

(D) STATE: DC 
(F) ZIP : 20005 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 
(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: NOT YET KNOWN 

(B) FILING DATE: 20-MARCH-1998 

(C) CLASSIFICATION : 
(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: BARKER, M. PAUL 

<B) REGISTRATION NUMBER: 32,013 

(C) REFERENCE / DOCKET NUMBER: 4121.0116.02304 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202 408-4000 

(B) TELEFAX: 202 408-4400 
(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANT I - SENSE : NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Xaa Xaa Leu His His Val Lys Leu lie Tyr Ala Thr Xaa Xaa Xaa 
15 10 15 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 
(v) FRAGMENT TYPE: N- terminal 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Arg Xaa Glu Xaa Leu Xaa Xaa 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

Xaa Leu Leu His His Val Lys Leu lie Tyr Ala Thr Lys Xaa Arg Xaa 
15 10 15 

Leu Val Gly Lys Xaa lie Val Leu Ala lie Pro Gly Xaa Xaa Ala Xaa 
20 25 30 

Xaa Xaa Xaa 
35 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Airg Xaa Glu Xaa Leu Xaa Glu Xaa 
15 10 15 

Xaa Xaa 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Xaa Tyr Asp Ala Val lie Met Ala Ala Ala Val Val Asp Phe Arg Pro 
1 5 10 15 

Lys 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
1 5 10 15 

Asn Gin Val Val Leu He Gly Arg 
20 

(2) .INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Gly Ala Met Leu His His Val Lys Leu He Tyr Ala Xaa Lys Leu Arg 
15 10 15 

Lys 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

{ D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Gly Ala Met Leu His His Val Lys Leu He Tyr Ala Thr Lys Xaa Xaa 
15 10 15 

Arg Lys 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
Met Leu His His Val Lys Leu He Tyr Ala Thr Lys Leu 
15 10 
(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gly Xaa Xaa Xaa Pro Asp Trp Xaa Xaa Lys Phe Arg Lys Glu Glu Ser 
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! 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE : NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Gly Ala lie Leu Leu Pro Asp Trp Lys lie Arg Lys Glu lie Leu lie 
15 10 15 

Glu 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Xaa Met His His Val lie Lys Leu Xaa Tyr Ala Thr Xaa Ser Arg Lys 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N-terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 13 : 

Met Leu Tyr Leu Val Arg Pro Asp Trp Lys Arg Arg Lys Glu lie Leu 
15 10 15 

lie Glu 

(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAYCAYGAHA ARYTHATTTA CGC 23 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 
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te) STRANDEDNESS : single 
(D) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 15 : 
GCCATDATNA CDGCRTCGTA TTT 23 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CAYCAYGAHA ARYTHATATA CGC 23 
(2) INFORMATION FOR SEQ ID NO: 17: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ARDACDACYT GRTTTTCTTC 20 
(2) INFORMATION FOR SEQ ID NO: 18: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 09 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
ATGCTTCACC ACGTCAAGCT AATCTACGCC ACAAAAAGTC GAAAGCTAGT TGGAAAAAAG 60 
ATAGTCNNNN NNNNNCCAGG GAGTATTGCG GCTTTGGATG TGAAAGCTTG TGAGGGACTA 120 
ATTAGGCATG GGGCCGAAGT TCATGCAGTG ATGAGTGAGG CAGCCACCAA GATAATTCAT 180 
CCTTATGCAT GGAATTTGCC CACGGGAAAT CCAGTCATAA CTGAGATCAC TGGATTTATC 240 
GAGCATGTTG AGTTAGCAGG GGAACATGAG AATAAAGCAG ATTTAATTTT GGTTTGTCCT 300 
GCCACTGCCA ACACAATTAG TAAGATTGCA TGTGGAATAG ATGATACTCC AGTAACTACA 360 
GTCGTGACCA CAGCATTTCC CCACATTCCA ATTATGATAG CCCCAGCAAT GCATGAGACA 420 
ATGTACAGGC ATCCCATAGT AAGGGAGAAC ATTGAAAGGT TAAAGAAGCT TGGCGTTGAG 480 
TTTATAGGAC CAAGAATTGA GGAGGGAAAG GCAAAAGTTG CAAGCATTGA TGAAATAGTT 540 
TACAGAGTTA TTAAAAAGCT CCACAAAAAA ACATTGGAAG GGAAGAGAGT CCTAGTAACG 600 
GCGGGAGCAA CAAGAGAGTA CATAGATCCA ATAAGATTCA TAACAAATGC CAGCAGTGGA 660 
AAAATGGGAG TAGCGTTGGC TGAAGAAGCA GATTTTAGAG GAGCTGTTAC CCTCATAAGA 720 
ACAAAGGGAA GTGTAAAGGC TTTTAGAATC AGAAAAATCA AATTGAAGGT TGAGACAGTG 780 
GAAGAAATGC TTTCAGCGAT TGAAAATGAG TTGAGGAGTA AAAAGTATGA CGTAGTTATT 840 
ATGGCAGCTG CTGTAAGCGA TTTTAGGCCA AAAATTAAAG CAGAGGGAAA AATTAAAAGC 900 
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GG AAGATCAA TAACGATAGA GCTCGTTCCN NNNAATCCCA AAATCATTGA TAGAATAAAG 960 
GAAATTCAAC CAAATGTCTT TCTTGTTGGA TTTAAAGCAG AAACTTCAAA AGAAAAGCTT 1020 
ATAGAAGAAG GTAAAAGGCA GATTGAGAGG GCCAAGGCTG ACTTAGTCGT TGGTAACACA 1080 
TTGGAAGCCT TTGGAAGCGA GGAAAACCAA GTAGTATTAA TTGGCAGAGA TTTCACAAAA 1140 
GAACTTCCAA AAATGAAAAA GAGAGAGTTA GCAGAGAGAA TTTGGGATGA GATAGAGAAA 1200 
TTNCTGTCC 1209 
(2) INFORMATION FOR SEQ ID NO: 19: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 amino acids 

(B) TYPE : amino acid 

{ D ) TOPOIiOG Y : unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu His His Val Lys Leu lie Tyx Ala Thr Lys Ser Arg Lys Leu 

15 10 15 

Val Gly Lys Lys lie Val Xaa Xaa Xaa Pro Gly Ser lie Ala Ala Leu 

20 25 30 

Asp Val Lys Ala Cys Glu Gly Leu lie Arg His Gly Ala Glu Val His 

35 40 45 

Ala Val Met Ser Glu Ala Ala Thr Lys lie He His Pro Tyr Ala Trp 

50 55 .60 

Asn Leu Pro Thr Gly Asn Pro Val He Thr Glu He Thr Gly Phe lie 
65 70 75 80 

Glu His Val Glu Leu Ala Gly Glu His Glu Asn Lys Ala Asp Leu He 

85 90 95 

Leu Val Cys Pro Ala Thr Ala Asn Thr He Ser Lys He Ala Cys Gly 

100 105 110 

He Asp Asp Thr Pro Val Thr Thr Val Val Thr Thr Ala Phe Pro His 

115 120 125 

He Pro He Met He Ala Pro Ala Met His Glu Thr Met Tyr Arg His 

130 135 140 

Pro He Val Arg Glu Asn He Glu Arg Leu Lys Lys Leu Gly Val Glu 
145 150 155 160 

Phe He Gly Pro Arg He Glu Glu Gly Arg Ala Lys Val Ala Ser He 

165 170 175 

Asp Glu He Val Tyr Arg Val He Lys Lys Leu His Lys Lys Thr Leu 

180 185 190. 

Glu Gly Lys Arg Val Leu Val Thr Ala Gly Ala Thr Arg Glu Tyr He 

195 200 205 

Asp Pro He Arg Phe He Thr Asn Ala Ser Ser Gly Lys Met Gly Val 

210 215 220 

Ala Leu Ala Glu Glu Ala Asp Phe Arg Gly Ala Val Thr Leu He Arg 
225 230 235 240 

Thr Lys Gly Ser Val Lys Ala Phe Arg He Arg Lys He Lys Leu Lys 

245 250 255 

Val Glu Thr Val Glu Glu Met Leu Ser Ala He Glu Asn Glu Leu Arg 

260 265 270 

Ser Lys Lys Tyr Asp Val Val He Met Ala Ala Ala Val Ser Asp Phe 

275 280 285 

Arg Pro Lys He Lys Ala Glu Gly Lys He Lys Ser Gly Arg Ser He 

290 295 300 

Thr He Glu Leu Val Pro Xaa Asn Pro Lys He He Asp Arg He Lys 
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305 310 315 320 

Glu lie Gin Pro Asn Val Phe Leu Val Gly Phe Lys Ala Glu Thr Ser 

325 330 335 

Lys Glu Lys Leu lie Glu Glu Gly Lys Arg Gin lie Glu Arg Ala Lys 

340 345 350 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
355 360 365 

Asn Gin Val Val Leu He Gly Arg Asp Phe Thr Lys Glu Leu Pro Lys 

370 375 380 

Met Lys Lys Arg Glu Leu Ala Glu Arg He Trp Asp Glu He Glu Lys 
385 390 395 400 

Xaa Leu Ser 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 3 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CATAGCGAAT TCGCAAAACC TTTCGCGGTA TGG 33 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ACTACGGAAT TCCACGGAAA ATGCCGCTCA TCC 33 
(2) INFORMATION FOR SEQ ID NO: 22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
GGCGTTTCCG TTCTTCTTCG 20 
(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
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(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CCATCTCACG CGCCAGTTTC 20 
(2) INFORMATION FOR SEQ ID NO: 24: 
5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOIiOGY: unknown 
10 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE.: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GAGGAGAGCA GGAAAGGTGG AAC 23 
15 (2) INFORMATION FOR SEQ ID NO: 25: 

.3 (i) SEQUENCE CHARACTERISTICS: 

^ (A) LENGTH: 21 base pairs 

• Ts (B) TYPE: nucleic acid 

[% (C) STRANDEDNESS : single 

20]q (D) TOPOLOGY: unknown 

jd (ii) MOLECULE TYPE: DNA (genomic) 

'!« (iii) HYPOTHETICAL: NO 

I** (iv) ANTI -SENSE: YES 

lM (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

25; ^ GCTGGGAGAA GACTTCACTG G 21 

l ^ (2) INFORMATION FOR SEQ ID NO: 26: 

[D (i) SEQUENCE CHARACTERISTICS: 

iU (A) LENGTH: 19 base pairs 

^3 (B) TYPE: nucleic acid 

303 (C) STRANDEDNESS: single 

ifl (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

GAGCTTGCTC AACTTTATC 19 
(2) INFORMATION FOR SEQ ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
45 (iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GATAGAGATA GTTTCTGGAG ACG 23 
(2) INFORMATION FOR SEQ ID NO: 28: 
(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 
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(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CGGGATATCG ACATTTCTGC ACC 
(2) INFORMATION FOR SEQ ID NO: 29: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid . 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CAGGACTCAG AAGCTGCTAT CGAA 
(2) INFORMATION FOR SEQ ID NO: 31: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CTGCACGTGC CCTGTAGGAT TTGT 
(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
CCAGAYTGGA ARWKNAGGAA AGA 
(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



SUBSTITUTE SHEET (RULE 26) 



WO 98/42860 V PCT/US98/05497 



-86- 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CCAGAYTGGA ARWKNAGAAA AGA 23 
(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

^0 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 

15 (iv) ANTI-SENSE: NO 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

ifi CCAGAYTGGA ARWKNAGGAA GGA 23 

id (2) INFORMATION FOR SEQ ID NO: 35: 

tjj (i) SEQUENCE CHARACTERISTICS: 

20-G (A) LENGTH: 23 base pairs 

i^j (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

|7| (D) TOPOLOGY : unknown 

"~ (ii) MOLECULE TYPE: DNA (genomic) 

25^ (iii) HYPOTHETICAL: NO 

! « (iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CCAGAYTGGA ARWKNAGAAA GGA 23 
j ^ (2) INFORMATION FOR SEQ ID NO: 36: 

30 y (i) SEQUENCE CHARACTERISTICS: 

■O (A) LENGTH: 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 
35 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CAGAGTGGGC AGAGAGGCTN TTGTTAAGGG GAAATTAATC GACGTGGAAA 
40 AGGAAGGAAA 60 

AGTCGNTATT CCTC CAAGGG AATA 84 
(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

45 (B) TYPE: amino acid 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 

(iv) ANTI-SENSE: NO 
50 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Glu Trp Ala Glu Arg Leu Leu Leu Arg Gly Asn Xaa Ser Lys Trp Lys 
! 5 10 15 
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Arg Lys Glu Lys Ser Xaa Phe Leu Gin Gly Asn 
20 25 
(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Arg Val Gly Arg Glu Ala Xaa Val Lys Gly Lys Leu lie Glu Val Glu 
x 5 10 15 

Lys Glu Gly Lys Val Xaa lie Pro Pro Arg Glu 
20 25 
(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 

(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Gin Ser Gly Gin Arg Gly Xaa Cys Xaa Gly Glu He Asn Arg Ser Gly 
x 5 10 15 

Lys Gly Arg Lys Ser Arg Tyr Ser Ser Lys Gly Leu 
20 25 
(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
CTGCCCACTC TGAGGTCATA ACCTGCTGGT TGGAGCCATT CTTCAGAAAA TGGCTCTATA 60 
AGTATTTCTT TTCTGATTTT CCAGTCTGGA AGTAGCATTT TACCACCGAA ACCTTTATTT 12 0 
TTAATTTAA 129 
(2) INFORMATION FOR SEQ ID NO: 41: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 42 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE : N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

Xaa lie Lys Asn Lys Gly Phe Gly Gly Lys Met Leu Leu Pro Asp Trp 
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15 10 15 

Lys lie Arg Lys Glu lie Leu lie Glu Pro Phe Ser Glu Glu Trp Leu 

20 25 30 

Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 
35 40 
(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TCCTCCAAGG GAATACGCCT TAATCCTAAC CCTCGAGAGG ATAAAGTTGC CCGACGATGT 60 
TATGGGGGAT ATGAAGATAA GGAGCAGTTT AGCAAGAGAA GGGGTTATTG GTTCTTTTGC 12 0 
TTGGGTTGAC CCAGGATGGG ATGGAAACTT AACACTAATG CTCTACAATG CCTCAAATGA 18 0 
ACCTGTCGAA TTAAGATATG GAGAGAGATT TGTGCAGATC GCATTTATAA GGCTAGAGGG 24 0 
TCCGGCAAGA AACCCTTACA GAGGAAACTA TCAGGGGAGC ACAAGGTTAG CGTTTTCAAA 3 00 
GAGAAAGAAA CTC T AGCGTC TTTTCAATAG CATCCTCAAT ATCTCGTGTG AAGTAATCAA 360 
TGTAAATACT TGCTGGGTGG GTTTTTAGGG ATTCAAACTC GTAAGATGGG CCTGTATAGC 42 0 
AGAAAACTAT TTTTGCCTCT TCTTCATTTA TCTTTCTGTG AATAAAAAAT CCAACATCCA 4 80 
CACTAGTTCC AAAAGATATT GTTTGCGTGA TTACCAACAA GATCTTGGCA TTATTTTTGA 54 0 
TCTTATACTC TATTCTCCTT TCTCCCTCCA ATTTGCCCAA AATAAACCTG GGTAGTATAC 600 
ATTCACTCCT CTCTTTTAAA TTCCTATAAA TTCGTACATA GTTTAGAAAA ATGTCAAATT 660 
CTTTNTTCCC TGTTAAATTA ACCNCNAAAT CTTTATNANN AANCTTTTTA TAATTCCCAA 720 
AACCC CTAAT TTTCCCCTTN 74 0 

(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: YES 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: N- terminal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

Leu Gin Gly Asn Thr Pro Xaa Ser Xaa Pro Ser Arg Gly Xaa Ser Cys 

15 10 15 

Pro Thr Met Leu Trp Gly lie Xaa Arg Xaa Gly Ala Val Xaa Gin Glu 

20 25 30 

Lys Gly Leu Leu Val Leu Leu Leu Gly Leu Thr Gin Asp Gly Met Glu 

35 40 45 

Thr Xaa His Xaa Cys Ser Thr Met Pro Gin Met Asn Leu Ser Asn Xaa 

50 55 60 

Asp Met Glu Arg Asp Leu Cys Arg Ser His Leu Xaa Gly Xaa Arg Val 
65 .70 75 80 

Arg Gin Glu Thr Leu Thr Glu Glu Thr lie Arg Gly Ala Gin Gly Xaa 

85 90 95 

Arg Phe Gin Arg Glu Arg Asn Ser Ser Val Phe Ser lie Ala Ser Ser 

100 105 110 

lie Ser Arg Val Lys Xaa Ser Met Xaa lie Leu Ala Gly Trp Val Phe 
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115 

Arg Asp Ser Asn Ser Xaa 
130 

Ala Ser Ser Ser Phe lie 
145 150 
Leu Val Pro Lys Asp lie 
165 

Leu Phe Leu He Leu Tyr 
180 

Lys He Asn Leu Gly Ser 
195 

Xaa He Arg Thr Xaa Phe 
210 

Lys Leu Thr Xaa Lys Ser 
225 230 
Pro Leu He Phe Pro Xaa 
245 

INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 



Pro 


Pro 


Arg 


Glu 


Tyr 


Ala 


Leu 


He 


Leu 


Thr 


Leu 


Glu 


Arg 


He 


Lys 


Leu 


1 








5 










10 










15 




Pro 


Asn 


Asn 


Val 


Met 


Gly 


Asp 


Met 


Lys 


He 


Arg 


Ser 


Ser 


Leu 


Ala 


Arg 








20 










25 










30 






Glu 


Gly 


Val 


He 


Gly 


Ser 


Phe 


Ala 


Trp 


Val 


Asp 


Pro 


Gly 


Trp 


Asp 


Gly 






35 










40 










45 








Asn 


Leu 


Thr 


Leu 


Met 


Leu 


Tyr 


Asn 


Ala 


Ser 


Asn 


Glu 


Pro 


Val 


Glu 


Leu 




50 










55 










60 










Arg 


Tyr 


Gly 


Glu 


Arg 


Phe 


Val 


Gin 


He 


Ala 


Phe 


He 


Arg 


Leu 


Glu 


Gly 


65 










70 










75 










80 


Pro 


Ala 


Arg 


Asn 


Pro 


Tyr 


Arg 


Gly 


Asn 


Tyr 


Gin 


Gly 


Ser 


Thr 


Arg 


Leu 










85 










90 










95 




Ala 


Phe 


Ser 


Lys 


Arg 


Lys 


Lys 


Leu 


Xaa 


Arg 


Leu 


Phe 


Asn 


Ser 


He 


Leu 








100 










105 










110 






Asn 


He 


Ser 


Cys 


Glu 


Val 


He 


Asn 


Val 


Asn 


Thr 


Cys 


Trp 


Val 


Gly 


Phe 






115 










120 










125 








Xaa 


Gly 


Phe 


Lys 


Leu 


Val 


Arg 


Trp 


Ala 


Cys 


He 


Ala 


Glu 


Asn 


Tyr 


Phe 




130 










135 










140 










Cys 


Leu 


Phe 


Phe 


He 


Tyr 


Leu 


Ser 


Val 


Asn 


Lys 


Lys 


Ser 


Asn 


He 


His 


145 










150 










155 










160 


Thr 


Ser 


Ser 


Lys 


Arg 


Tyr 


Cys 


Leu 


Arg 


Asp 


Tyr 


Gin 


Gin 


Asp 


Leu 


Gly 










165 










170 










175 




He 


He 


Phe 


Asp 


Leu 


He 


Leu 


Tyr 


Ser 


Pro 


Phe 


Ser 


Leu 


Gin 


Phe 


Ala 








180 










185 










190 






Gin 


Asn 


Lys 


Pro 


Gly 


Xaa 


Tyr 


Thr 


Phe 


Thr 


Pro 


Leu 


Phe 


Xaa 


He 


Pro 






195 










200 










205 








He 


Asn 


Ser 


Tyr 


He 


Val 


Xaa 


Lys 


Asn 


Val 


Lys 


Phe 


Phe 


Xaa 


Pro 


Cys 



210 215 220 



120 

Asp Gly Pro Val Xaa 
135 

Phe Leu Xaa He Lys 
155 

Val Cys Val He Thr 
170 

Ser He Leu Leu Ser 
185 

He His Ser Leu Leu 
200 

Arg Lys Met Ser Asn 
215 

Leu Xaa Xaa Xaa Phe 
235 



125 

Gin Lys Thr He Phe 
140 

Asn Pro Thr Ser Thr 
160 

Asn Lys He Leu Ala 
175 

Pro Ser Asn Leu Pro 
190 

Ser Phe Lys Phe Leu 
205 

Ser Xaa Phe Pro Val 
220 

Leu Xaa Phe Pro Lys 
240 
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Xaa lie Asn Xaa Xaa lie Phe Xaa Xaa Xaa Leu Phe lie lie Pro Lys 
225 230 235 240 

Thr Pro Asn Phe Pro Leu 
245 

(2> INFORMATION FOR SEQ ID NO: 45: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 246 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



Ser 


Ser 


Lys 


Gly 


He 


Arg 


Leu 


Asn 


Pro 


Asn 


Pro 


Arg 


Glu 


Asp 


Lys 


Val 


1 








5 










10 










15 




Ala 


Arg 


Arg 


Cys 


Tyr 


Gly 


Gly 


Tyr 


Glu 


Asp 


Lys 


Glu 


Gin 


Phe 


Ser 


Lys 








20 










25 










30 






Arg 


Arg 


Gly 


Tyr 


Trp 


Phe 


Phe 


Cys 


Leu 


Gly 


Xaa 


Pro 


Arg 


Met 


Gly Trp 






35 










40 










45 








Lys 


Leu 


Asn 


Thr 


Asn 


Ala 


Leu 


Gin 


Cys 


Leu 


Lys 


Xaa 


Thr 


Cys 


Arg 


He 




50 










55 










60 










Lys 


lie 


Trp 


Arg 


Glu 


He 


Cys 


Ala 


Asp 


Arg 


He 


Tyr 


Lys 


Ala 


Arg 


Gly 


65 










70 










75 










80 


Ser 


Gly 


Lys 


Lys 


Pro 


Leu 


Gin 


Arg 


Lys 


Leu 


Ser Gly 


Glu 


His 


Lys 


Val 










85 










90 










95 




Ser 


Val 


Phe 


Lys 


Glu 


Lys 


Glu 


Thr 


Leu 


Ala 


Ser 


Phe 


Gin 


Xaa 


His 


Pro 








100 










105 










110 






Gin 


Tyr 


Leu 


Val 


Xaa 


Ser 


Asn 


Gin 


Cys 


Lys 


Tyr 


Leu 


Leu 


Gly Gly 


Phe 






115 










120 










125 








Leu 


Gly 


He 


Gin 


Thr 


Arg 


Lys 


Met 


Gly 


Leu 


Tyr 


Ser 


Arg 


Lys 


Leu 


Phe 




130 










135 










14 0 










Leu 


Pro 


Leu 


Leu 


His 


Leu 


Ser 


Phe 


Cys 


Glu 


Xaa 


Lys 


He 


Gin 


His 


Pro 


145 










150 










155 










160 


His 


Xaa 


Phe 


Gin 


Lys 


He 


Leu 


Phe 


Ala 


Xaa 


Leu 


Pro 


Thr 


Arg 


Ser 


Trp 










165 










170 










175 




His 


Tyr 


Phe 


Xaa 


Ser 


Tyr 


Thr 


Leu 


Phe 


Ser 


Phe 


Leu 


Pro 


Pro 


He 


Cys 








180 










185 










190 






Pro 


Lys 


Xaa 


Thr 


Trp 


Val 


Val 


Tyr 


He 


His 


Ser 


Ser 


Leu 


Leu 


Asn 


Ser 






195 










200 










205 








Tyr 


Lys 


Phe 


Val 


His 


Ser 


Leu 


Glu 


Lys 


Cys 


Gin 


He 


Leu 


Xaa 


Ser 


Leu 




210 










215 










220 










Leu 


Asn 


Xaa 


Pro 


Xaa 


Asn 


Leu 


Tyr 


Xaa 


Xaa 


Xaa 


Phe 


Tyr 


Asn 


Ser 


Gin 


225 










230 










235 










240 


Asn 


Pro 


Xaa 


Phe 


Ser 


Pro 























245 

(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: N-terminal 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO:46: 

Met Leu His His Val Lys Leu lie Tyr Ala Thr Lys Ser Arg Lys Leu 
1 5 10 15 

Val Gly Lys Lys lie Val Xaa Xaa Xaa Pro Gly Ser lie Ala Ala 
20 25 30 

(2) INFORMATION FOR SEQ ID NO: 47: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS ; unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Lys Tyr Asp Val Val lie Met Ala Ala Ala Val Ser Asp Phe Arg Phe 
15 10 15 

Lys 

(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

Ala Asp Leu Val Val Gly Asn Thr Leu Glu Ala Phe Gly Ser Glu Glu 
15 10 15 

Asn Gin Val Val Leu lie Gly Arg 
20 

(2) INFORMATION FOR SEQ ID NO: 49: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 
CTATTGAGTA CGAACGCCAT C 21 
(2) INFORMATION FOR SEQ ID NO: 50: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
GTCACGCTTG CTCCACTCCG 20 
(2) INFORMATION FOR SEQ ID NO: 51: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanococcus Jannaschii 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

Met lie Ser Glu lie Met His Pro Thr Lys Leu Leu Lys Gly Thr Lys . 

1 5 10 15 

Ser Lys Leu Leu Glu Asn Lys Lys lie Leu Val Ala Val Thr Ser Ser 

20 25 30 

lie Ala Ala lie Glu Thr Pro Lys Leu Met Arg Glu Leu He Arg His 

35 40 45 

Gly Ala Glu Val Tyr Cys He He Thr Glu Glu Thr Lys Lys He He 

50 55 60 

Gly Lys Glu Ala Leu Lys Phe Gly Cys Gly Asn Glu Val Tyr Glu Glu 
65 70 75 80 

He Thr Gly Xaa Xaa Xaa Xaa Xaa Asp He Glu His He Leu Leu Tyr 

85 90 95 

Xaa Xaa Xaa Xaa Asn Glu Cys Asp Cys Leu Leu He Tyr Pro Ala Thr 

100 105 HO 

Ala Asn He He Ser Lys He Asn Leu Gly He Ala Asp Asn He Val 

115 120 125 

Asn Thr Thr Ala Leu Met Phe Phe Gly Asn Lys Pro He Phe He Val 

130 135 140 

Pro Ala Met His Glu Asn Met Phe Asn Xaa Xaa Ala He Lys Arg His 
145 150 155 160 

He Asp Lys Leu Lys Glu Lys Asp Lys He Tyr He He Ser Pro Lys 

165 170 175 

Phe Glu Glu Xaa Xaa Xaa Xaa Xaa Xaa Gly Lys Ala Lys Val Ala Asn 

180 185 190 

He Glu Asp Val Val Lys Ala Val He Glu Lys He Gly Asn Asn Leu 

195 200 205 

Lys Lys Glu Gly Asn Arg Val Leu He Leu Asn Gly Gly Thr Val Glu 

210 215 220 

Phe He Asp Lys Val Arg Val He Ser Asn Leu Ser Ser Gly Lys Met 
225 230 235 240 

Gly Val Ala Leu Ala Glu Ala Phe Cys Lys Glu Gly Phe Tyr Val Glu 

245 250 255 

Val He Thr Ala Met Gly Leu Glu Pro Pro Tyr Tyr He Lys Asn His 

260 265 270 

Lys Val Leu Thr* Ala Lys Glu Met Leu Asn Lys Ala He Glu Xaa Xaa 

275 280 285 

Leu Xaa Ala Lys Asp Phe Asp He He He Ser Ser Ala Ala He Ser 

290 295 300 

Asp Phe Thr Val Glu Ser Xaa Phe Glu Gly Lys Leu Ser Ser Glu Glu 
305 310 315 320 

Glu Xaa Xaa Xaa Xaa Leu He Leu Lys Leu Lys Arg Xaa Asn Pro Lys 
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325 










330 








335 




Val 


Leu 


Glu 


Glu 


Leu 


Arg 


Arg 


He 


Tyr 


Lys 


Asp 


Xaa Lys 


Val 


He 


He 








340 










345 








350 






Gly 


Phe 


Lys 


Ala 


Glu 


Tyr 


Asn 


Leu 


Asp 


Glu 


Lys 


Glu Leu 


He 


Asn 


Arg 






355 










360 








365 








Ala 


Lys 


Glu 


Arg 


Leu 


Asn 


Lys 


Tyr 


Asn 


Leu 


Asn 


Met He 


He 


Ala 


Asn 




370 










375 










380 








Asp 


Leu 


Ser 


Lys 


Xaa 


Xaa 


His 


Tyr 


Phe 


Gly Asp Asp Tyr 


He 


Glu 


Val 


385 










390 










395 








400 


Tyr 


lie 


lie 


Thr 


Lys 


Tyr 


Glu 


Val 


Glu 


Lys 


He 


Ser Gly 


Ser 


Lys 


Lys 










405 










410 








415 




Xaa 


Glu 


He 


Ser 


Glu 


Arg 


He 


Val 


Glu 


Lys 


Val 


Lys Lys 


Leu 


Val 


Lys 








420 










425 








430 






Ser 


Xaa 


Xaa 


Xaa 


Xaa 























10 



15 435 

(2) INFORMATION FOR SEQ ID NO: 52: 
! 2 (i) SEQUENCE CHARACTERISTICS: 

''M (A) LENGTH: 444 amino acids 

^ (B) TYPE: amino acid 

20 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: protein 
j 3 (iii) HYPOTHETICAL: NO 

! 3 (iv) ANTI-SENSE: NO 

W (vi) ORIGINAL SOURCE: 

25 (A) ORGANISM: Escherichia coli 

□ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

'Q Met Lys Ala Arg Gin Gin Lys Tyr Cys Asp Lys He Ala Asn Phe Trp 

m 1 5 10 15 

i3 Cys His Pro Thr Gly Lys He He Met Ser Leu Ala Gly Lys Lys He 

30 20 25 30 

Val Leu Gly Val Ser Gly Gly He Ala Ala Tyr Lys Thr Pro Glu Leu 

35 40 45 

Val Arg Arg Leu Arg Asp Arg Gly Ala Asp Val Arg Val Ala Met Thr 
50 55 60 

35 Glu Ala Ala Lys Ala Phe He Thr Pro Leu Ser Leu Gin Ala Val Ser 

65 70 75 80 

Gly Tyr Pro Val Ser Asp Ser Leu Leu Asp Pro Ala Ala Glu Ala Ala 

85 90 95 

Met Gly His He Glu Leu Gly Xaa Xaa Xaa Xaa Lys Trp Ala Asp Leu 
40 100 105 110 

Val He Leu Ala Pro Ala Thr Ala Asp Leu He Ala Arg Val Ala Ala 

115 120 125 

Gly Met Ala Asn Asp Leu Val Ser Thr He Cys Leu Ala Thr Pro Xaa 
130 135 140 

45 Xaa Ala Pro Val Ala Val Leu Pro Ala Met Asn Gin Gin Met Tyr Arg 

145 150 155 160 

Ala Ala Ala Thr Gin His Asn Leu Glu Val Leu Ala Xaa Ser Arg Gly 

165 170 175 

Leu Leu lie Trp Gly Pro Asp Ser Gly Ser Gin Ala Cys Gly Asp He 
50 180 185 190 

Gly Pro Gly Arg Xaa Xaa Asp Pro Leu Thr He Val Asp Met Ala Val 

195 200 205 

Ala His Phe Ser Pro Val Asn Asp Leu Lys His Leu Asn He Met He 
210 215 220 
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Thr Ala Gly Pro Thr Arg Glu Pro Leu Asp Pro Val Arg Tyr lie Ser 
225 230 235 240 

Asn His Ser Ser Gly Lys Met Gly Phe Ala lie Ala Ala Ala Ala Ala 
245 250 255 

5 Arg Arg Gly Ala Asn Val Thr Leu Val Ser Gly Pro Val Ser Leu Pro 

260 265 270 

Thr Pro Pro Phe Val Lys Arg Val Asp Val Met Thr Ala Leu Glu Met 

275 280 285 

Glu Ala Ala Val Asn Xaa Xaa Ala Ser Val Gin Gin Gin Asn lie Phe 
10 290 295 300 

lie Gly Cys Ala Ala Val Ala Asp Tyr Arg Ala Ala Thr Val Ala Pro 
305 310 315 320 

Glu Lys lie Lys Lys Gin Ala Thr Gin Gly Asp Glu Leu Thr He Lys 
325 330 335 

15 Met Val Lys Xaa Asn Pro Asp He Val Ala Gly Val Ala Ala Leu Lys 

340 345 350 

Asp His Arg Pro Tyr Val Val Gly Phe Ala Ala Glu Thr Asn Asn Xaa 

355 360 365 

Xaa Xaa Xaa Val Glu Glu Tyr Ala Arg Gin Lys Arg He Arg Lys Asn 
26^ 370 375 380 

iE; Leu Asp Leu He Cys Ala Asn Asp Val Ser Gin Pro Thr Gin Gly Phe 

IS 385 390 395 400 

Asn Ser Asp Asn Asn Ala Leu His Leu Phe Trp Gin Asp Gly Asp Lys 
^ 405 410 415 

25^ Val Leu Pro Leu Glu Arg Lys Glu Leu Leu Gly Gin Leu Leu Leu Asp 

^ 420 425 430 

S Glu He Val Thr Arg Tyr Asp Glu Lys Asn Arg Arg 

|y 435 440 

□ (2) INFORMATION FOR SEQ ID NO: 53: 

3Q0 (i) SEQUENCE CHARACTERISTICS: 

tjg (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
35 <iii) HYPOTHETICAL: YES 

(iv) ANTI- SENSE: NO 

(v) FRAGMENT TYPE: internal 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 
40 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 54: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL : NO 
(iv) ANTI - SENSE : NO 
(v) FRAGMENT TYPE: internal 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 
15 10 
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(2) INFORMATION FOR SEQ ID NO: 55: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 

Ala Gly Trp lie Asp Ala Gly Phe Lys Gly Lys He Thr Leu 
15 10 
(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 
(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Pro Thr lie Val Asp Ala Gly Phe Glu Gly Gin Leu Thr lie 
1 5. 10 

(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

. Ala His Arg lie Asp Pro Gly Trp Ser Gly Cys lie Val Leu 
15 10 
(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GAGTTAAATG CCTACACTGT ATCT 
(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 
CAGGACTCAG AAGCTG CTAT CGAA 
(2) INFORMATION FOR SEQ ID NO: 61: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CTGCACGTGC CCTGTAGGAT TTGT 
2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CTATTGAGTA CGAACGCCAT C 
(2) INFORMATION FOR SEQ ID NO: 63: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 20 base pairs 

(B) , TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 63 : 
GTCACGCTTG CTCCACTCCG 
(2) INFORMATION FOR SEQ ID NO: 64: 
( i ) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
G AGG AG AG CA GGAAAGGTGG AAC 
(2) INFORMATION FOR SEQ ID NO: 65: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GCTGGGAGAA GACTTCACTG G 
(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
His His Val Lys Leu lie Tyr Ala 
1 5 
(2) INFORMATION FOR SEQ ID NO : 67 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
Lys Tyr Asp Ala Val lie Met Ala 
1 5 
(2) INFORMATION FOR SEQ ID NO: 68: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
Glu Glu Asn Gin Val Val Leu 
1 5 
(2) INFORMATION FOR SEQ ID NO: 69: 
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(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNES S : unknown 

(D) TOPOLOGY : unknown 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
Pro Asp Trp Lys lie Arg Lys Glu 
1 5 
(2) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
ATGCTACTTC CAGACTGGAA AATCAGAAAA GAAATACTTA TAGAGCCATT TTCTGAAGAA 60 
TCGCTCCAAC CAGCAGGTTA TGACCTCAGA GTGGGCAGAG AGGCTTTTGT TAAGGGGAAA 12 0 
TTAATCGACG TGGAAAAGGA AGGAAAAGTC GTTATTCCTC CAAGGGAATA CGCCTTAATC 180 
CTAACCCTCG AGAGGATAAA GTTGCCCGAC GATGTTATGG GGGATATGAA GATAAGGAGC 24 0 
AGTTTAGCAA GAGAAGGGGT TATTGGTTCT TTTGCTTGGG TTGACCCAGG ATGGGATGGA 300 
AACTTAACAC TAATGCTCTA CAATGCCTCA AATGAACCTG TCGAATTAAG ATATGGAGAG 360 
AGATTTGTGC AGATCGCATT TATAAGGCTA GAGGGTCCGG CAAGAAACCC TTACAGAGGA 42 0 
AACTATCAGG GGAGCACAAG GTTAGCGTTT TCAAAGAGAA AGAAACTCTA G 471 
(2) INFORMATION FOR SEQ ID NO: 71: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 156 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY: unknown 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

Met Leu Leu Pro Asp Trp Lys lie Arg Lys Glu lie Leu lie Glu Pro 

15 10 15 

Phe Ser Glu Glu Ser Leu Gin Pro Ala Gly Tyr Asp Leu Arg Val Gly 

20 25 30 

Arg Glu Ala Phe Val Lys Gly Lys Leu lie Asp Val Glu Lys Glu Gly 

35 40 45 

Lys Val Val lie Pro Pro Arg Glu Tyr Ala Leu lie Leu Thr Leu Glu 

50 55 60 

Arg lie Lys Leu Pro Asp Asp Val Met Gly Asp Met Lys lie Arg Ser 
65 70 75 80 

Ser Leu Ala Arg Glu Gly Val lie Gly Ser Phe Ala Trp Val Asp Pro 

85 90 95 

Gly Trp Asp Gly Asn Leu Thr Leu Met Leu Tyr Asn Ala Ser Asn Glu 
100 105 110 
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Pro Val Glu Leu Arg Tyr Gly Glu Arg Phe Val Gin He Ala Phe He 

115 120 125 

Arg Leu Glu Gly Pro Ala Arg Asn Pro Tyr Arg Gly Asn Tyr Gin Gly 
130 135 140 

5 Ser Thr Arg Leu Ala Phe Ser Lys Arg Lys Lys Leu 

145 150 155 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

10 (B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: YES 

15 (iv) ANTI-SENSE: NO 

,«i (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Xaa Gly Xaa Xaa Asp Xaa Xaa Xaa Gly Xaa Xaa Xaa Xaa 

itt 1 5 10 

^ (2) INFORMATION FOR SEQ ID NO: 73: 

2fri (i) SEQUENCE CHARACTERISTICS: 

(A)- LENGTH: 14 amino acids 

^ (B) TYPE: amino acid 

|«f (C) STRANDEDNESS : unknown 

: ^ (D) TOPOLOGY: unknown 

25^ (ii) MOLECULE TYPE: peptide 

j 3 (iii) HYPOTHETICAL: NO 

=D (iv) ANTI -SENSE: NO 

|y (v) FRAGMENT TYPE: internal 

Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

3<3J Phe Ala Trp Val Asp Pro Gly Trp Asp Gly Asn Thr Leu Met 

- t Q 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 74: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

35 (B) TYPE: amino acid 

( c ) STRAND EDNES S : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
40 (iv) ANTI -SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Ala Gly Trp lie Asp Ala Gly Phe Lys Gly Lys lie Thr Leu 
15 10 
45 (2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
50 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
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(v) FRAGMENT TYPE: internal 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
Ser Ala Val His Asp Pro Gly Tyr Glu Gly Arg Pro Glu Tyr 
15 10 
5 (2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
10 (D) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
(v) FRAGMENT TYPE: internal 

1EK (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

'S Pro Thr lie Val Asp Ala Gly Phe Glu Gly Gin Leu Thr lie 

(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 



202 (A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNES S : unknown 

(D) TOPOLOGY: unknown 
(ii) MOLECULE TYPE: peptide 

253 (iii) HYPOTHETICAL: NO 

; fcJ (iv) ANTI -SENSE: NO 

lU (v) FRAGMENT TYPE: internal 

Q - (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

iJ3 Ala His Arg He Asp Pro Gly Trp Ser Gly Cys He Val Leu 

30j3 l 5 10 

(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

35 (C) STRANDEDNESS: unknown 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
40 (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

Val Gly Leu He Asp Ser Asp Tyr Gin Gly Gin Leu Met He 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 79: 
45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 
{ D ) TOPOLOGY : unknown 

50 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Ala Gly Val Val Asp Arg Asp Tyr Thr Gly Glu Val Lys Val 
15 10 
(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRAND EDNESS : unknown 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

<iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 80: 

Ala Gly Val lie Asp Glu Asp Tyr Arg Gly Asn Val Gly Val 
15 10 
(2) INFORMATION FOR SEQ ID NO: 81: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : unknown 

(D) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL : NO 
(iv) ANTI-SENSE: NO 

(v) FRAGMENT TYPE: internal 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Thr Gly Leu lie Asp Pro Gly Phe Gin Gly Glu Leu Lys Leu 
15 10 
(2) INFORMATION FOR SEQ ID NO: 82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
GACGACGACA AGATGCTACT TCCAGACTGG AAA 
(2) INFORMATION FOR SEQ ID NO:B3: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 



(ii) MOLECULE TYPE : DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 
GGAACAAGAC CCGTCCCACT TTCACAGATG AAGAG 35 
(2) INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : unknown 

5 (ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 
GAGGAGAGCA GGAAAGGTGG AAC 23 
10 (2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
[% (iii) HYPOTHETICAL: NO 

l M (iv) ANTI - SENSE : YES 

; i? (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

20 CTCCATGTCC CAACTCCGAT CAC 2 3 

(2) INFORMATION FOR SEQ ID NO: 86: 

^ (i) SEQUENCE CHARACTERISTICS : 

j«f (A) LENGTH: 3 8 base pairs 

l -M ~ (B) TYPE: nucleic acid 

25 _ x (C) STRANDEDNESS: single 

Q (D) TOPOLOGY: unknown 

5 J3 (ii) MOLECULE TYPE: DNA (genomic) 

jlj (iii) HYPOTHETICAL: NO 

□ (iv) ANTI-SENSE: NO 

30 iQ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

iQ GGTTTTCCCA GTCACGACGT TGTAAAACGA CGGCCAGT 3 8 

(2) INFORMATION FOR SEQ ID NO: 87: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
40 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 
GGUUUUCCCA GUCACGACGU UGUAAAACGA CGGCCAGU 38 
(2) INFORMATION FOR SEQ ID NO: 88: 
(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

50 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 
GACGACGACA AGATGCCCTG CTCTGAAGAG ACACC 35 
(2) INFORMATION FOR SEQ ID NO: 89: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : unknown 
(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 
GGAACAAGAC CCGTTTAATT CTTTCCAGTG GAACC 35 
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